Typesafe Modeling in Text Mining
نویسنده
چکیده
class SensevalData (s: String ) extends Agent [String , Context ] { val file = new File(s) val trainDataReader = new STAXSensevalDataReader (file); val samples : List[ Ambiguity ] = trainDataReader . getAmbiguities . toList val words : java.util.List[ String ] = trainDataReader . getWords def process ( input : java.util.List[ Annotation [ String ]]) = samples .map( asAnnotation (_)) def asAnnotation (amb: Ambiguity ): Anno[ Context ] = { Annotations . create [ Context ]( classOf [ SensevalData ], amb. getContext :Context , amb. getContext . targetStart :Int , amb. getContext . targetEnd :Int) } override def toString = file. toURL . toString } SensevalSense Agent für Zugriff auf Zielworte der Daten (in Scala, Agent[Context, Ambiguity]): class SensevalSense (s: String ) extends Agent [Context , Ambiguity ] { val words : List[ String ] = Nil 66 Eine Definition des Text-Mining von http://en.wikipedia.org/wiki/Text_mining
منابع مشابه
Topic Modeling and Classification of Cyberspace Papers Using Text Mining
The global cyberspace networks provide individuals with platforms to can interact, exchange ideas, share information, provide social support, conduct business, create artistic media, play games, engage in political discussions, and many more. The term cyberspace has become a conventional means to describe anything associated with the Internet and the diverse Internet culture. In fact, cyberspac...
متن کاملA review of text mining approaches and their function in discovering and extracting a topic
Background and aim: Four text mining methods are examined and focused on understanding and identifying their properties and limitations in subject discovery. Methodology: The study is an analytical review of the literature of text mining and topic modeling. Findings: LSA could be used to classify specific and unique topics in documents that address only a single topic. The other three text min...
متن کاملA Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متنکاوی در حوزه یادگیری الکترونیکی
As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...
متن کاملText mining in computational advertising
Computational advertising uses information on web browsing activity and additional covariates to select pop-up advertisements to display to the user. The statistical challenge is to develop methodology that matches ads to users who are likely to purchase the advertised product. These methods involve text mining but may also draw upon additional modeling related to both the user and the advertis...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1108.0363 شماره
صفحات -
تاریخ انتشار 2011